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Abstract. We study the secondary structure of RNA determined by Watson- 
Crick pairing without pseudo-knots using Milnor invariants of links. We focus 
on the first non-trivial invariant, which we call the Heiscnberg invariant. The 
Heisenberg invariant, which is an integer, can be interpreted in terms of the 
Heisenberg group as well as in terms of lattice paths. 

We show that the Heisenberg invariant gives a lower bound on the number 
of unpaired bases in an RNA secondary structure. We also show that the 
Heisenberg invariant can predict allosteric structures for RNA. Namely, if the 
Heisenberg invariant is large, then there are widely separated local maxima 
(i.e., allosteric structures) for the number of Watson-Crick pairs found. 



1. Introduction 

Ribonucleic acid (RNA) is a nucleic acid polymer consisting of nucleotide monomers, 
each of which is of one of four types determined by the nucleotide base present in it. 
RNA plays a central role in living cells, specifically in the synthesis of proteins using 
DNA as a template. In addition, RNA itself can serve as an information carrier 
and also has catalytic properties. Indeed its ability to serve as both an information 
carrier and a catalyst has led to speculation that life began as an RNA world |12j . 

The primary structure of an RNA molecule is the sequence of nucleotide bases 
in it. In addition to this, the properties of an RNA molecule depend strongly on 
the secondary structure, which is the 3-dimensional shape of the molecule |13j . 

The bases form two complementary pairs, with members of a pair forming strong 
hydrogen bonds. Thus, if two subsequences of the RNA sequence are complemen- 
tary, a stable secondary structure called a stem loop structure can be formed by 
bonds between complementary pairs in these subsequences. We study here these 
secondary structures of an RNA molecule, determined by the Watson-Crick pairing. 
We shall consider structures without so called pseudo-knots. For basic concepts for 
RNA folding, we refer to and [T^ • For surveys and other studies of RNA fold- 
ing, we refer to |12|-[28j. Henceforth by secondary structure we mean the secondary 
structure determined by Watson-Crick pairing without pseudo- knots. 

We focus here on introducing new methods to yield a conceptual understanding 
of RNA folding. The model we consider - Watson-Crick pairing without pseudo- 
knots, is clearly an approximation in various ways. Firstly, stereo-chemical forces 
do not allow very short loops. Secondly, pseudo-knots are present in nature. Thus, 
allowing short loops but not pseudo-knots is an approximation of stereo-chemical 
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forces. Further, we do not take into account the difference between the strengths 
of the A-U and the G-C bonds, and also ignore the non- Watson-Crick bonds. 

Thus, our model is clearly not appropriate for the computational study of in- 
dividual RNA molecules. Our goal is rather to introduce new methods for under- 
standing RNA secondary structure, and show that these are very powerful in the 
context of our model. One can extend these methods to take into account more 
realistic models of RNA secondary structure. 

Our methods can be motivated by a simple observation - if, for example, there 
are more Adenine than Uracil bases, then some Adenine bases must be unpaired. 
These considerations give a very elementary lower bound on the number of unpaired 
bases. Mathematically, this can be viewed as coming from abelianisation. Another 
context in which abelianisation gives the simplest criteria is the linking number for 
classical links. In this paper, we show that one can adapt Milnor's theory of higher 
linking numbers to the context of RNA. 

We use a natural model for Watson-Crick pairing of RNA (without pseudo- 
knots) in terms of the free group F on two generators a and (3. An element of 
the free group is given by a word in the four letters a, a, P and /3, with a and /3 
the inverses of a and /3 respectively. We identify these letters with the nucleotides 
Adenine, Uracil, Guanine and Cytosine respectively. Under this identification, an 
RNA molecule gives a string in the four letters a, /?, a and f3. 

Stem loops, which are the basic units of RNA secondary structure, then corre- 
spond to words in the free group of the form gig, with g and I words in the free 
group and g the inverse of g (see figure [1]). One may further have Watson-Crick 
pairing within the word I, so that a subword of the RNA sequence may be of the 
form ga{hlh)bg as in figure O In general, a secondary structure without pseudo- 
knots consists of pairings between letters and their inverses so that there is no 
nesting. Wc formalise such a structure (which we call a folding) in Definition 15.11 
The appropriate energy, whose local and global minima we study, is the number of 
unpaired bases in the secondary structure. 

Mathematically, one can interpret the above as saying that the number of un- 
paired bases is a conjugacy invariant norm. For, a secondary structure on RNA 
corresponding to the word w gives one for the word gwg (with the initial segment 
corresponding to g paired with the final segment corresponding to g) so that the 
number of unpaired bases is the same. Further, given secondary structures on 
strands of RNA corresponding to words wi and W2 , we get a secondary structure 
on the strand corresponding to wiW2 (the concatentation of the words). This paper 
is based on the observation that conjugacy invariance can be taken into account by 
considering nilpotcnt quotients. 

In analogy with Milnor invariants, we shall associate numbers to words g in the 
letters a, (3, a and /3, which wc call invariants according to terminology familiar 
in mathematics (they are not to be taken as invariant in any biological sense). By 
the above, it is desirable that they are invariant under conjugation, or at least the 
extent to which they are not invariant can be estimated. To achieve sub-additivity, 
we consider additive functions of g and take their absolute value, or more generally 
sums of the absolute values of such functions. 

As we have seen, an obvious lower bound for the number of unpaired elements 
is given by comparing the number of letters that are a with the number that arc 
a and similarly for /3 and /3. Wc denote the difference between the number of 
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letters of a word g that are a and the number that are a by a{g). Similarly, h{g) 
denotes the difference between the number of letters that are (3 and the number 
that are (3. In other words we look at the image of g under the abelianisation 
map ab : F ^ I? . Then a{g) and h{g) denote the co-ordinates of ah{g) so that 
ab{g) = a{g)ab{a) + b{g)ah{(3). The numbers a{g) and h{g) are the first of our 
invariants. 

Clearly, the minimum number of unpaired bases in g is at least \a{g)\ + 1^(5)1, 
but this is very far from sharp. We proceed further in analogy with Milnor's theory 
of link homotopy. The simplest invariants of a link are the linking numbers. These 
are given by considering the image of a curve in the abelianisation of an appropriate 
fundamental group, and are thus analogous to a{g) and h{g). Milnor constructed 
higher linking numbers by considering appropriate nilpotent quotients. 

We shall associate an appropriate link to RNA molecules in Section (TU] and 
construct various invariants. The main focus in this paper, however, is to construct 
and study the first of these higher invariants. It is easiest to proceed with a direct 
algebraic description. This description is in terms of another familiar object - the 
Heisenberg group. We thus call this invariant the Heisenberg invariant i'{g). There 
is also a nice geometric view of this invariant in terms of areas enclosed by lattice 
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paths. We also provide an elementary combinatorial description in Theorem 16.11 
which allows for easy computation. 

We show that the Heisenberg invariant gives a lower bound on the number of 
unpaired bases in an RNA molecule, and hence the potential energy of a secondary 
structure. 

We also show that the Heisenberg invariant is related to a biologically significant 
property of Watson-Crick pairing. Namely, wc show that if v^g) is sufficiently large, 
then there are allosteric structures, i.e., local minima for the number of unpaired 
bases that are widely separated. This means that there are two ways of folding 
the sequence so that we cannot pass from one to the other without significantly 
increasing the number of unpaired bases (here a folding is an abstraction of the 
secondary structure). Thus, the Heisenberg invariant as well as the higher invariants 
should prove very fruitful in the study of RNA secondary structures. 

Our results depend on our simplified model. However, it is easy to see that 
similar results continue to hold even if one makes the model biologically more 
realistic by taking into account that nearby bases do not pair (i.e., there arc no very 
short loops). In the case of the lower bound this is obvious, as further restrictions 
can only increase the number of unpaired bases. The result concerning allosteric 
structures also extends as we sketch following the proof of the result for our model. 

2. The Heisenberg invariant 

The Milnor higher link invariants [9] [10] are based on the lower central series of 
a group. We recall some basic definitions. 

Consider a group G. For elements a,b & G, a denotes the inverse of a and [a, b] 
denotes the commutator abah. For subgroups Hi,H2 C G, we define [Hi,H2] to be 
the normal subgroup generated by elements of the form [a,b], a (1 Hi, b Q H2. 

The lower central series of G is defined inductively as follows. Let Gi = G. If 
Gn has been defined, we define Gn+i to be [G, G„]. In particular, G2 = [G, G] and 
the abelianisation of G is G/G2. Note that Gi D G2 D G3 D . . . . 

Consider now the free group F generated by a and p. Let H be the group 
F/F3. The Heisenberg invariant is obtained by considering the image [g] G H of 
an element g G F. It is well known that this is the Heisenberg group, which is the 
unique central extension of by Z. For the convenience of the reader, we prove 
these properties below. 

Proposition 2.1. There is an exact sequence 

(1) l^Z^i7^Z2->l 

with the image of Z central in H and generated by [a, f3] . Further, H is isomorphic 
to the Heisenberg group. 

Proof. As F3 C F2, the abelianisation homomorphism ab : F ^1? = F/F2 induces 
a surjective homomorphism Lp : H = F j F-j, ^ F j F2 = 'i? ■ It is well known that 
the kernel of a6 : F ^ Z^ is the normal subgroup in F generated by [a, Hence 
the kernel of Lp is the normal subgroup in H generated by the equivalence class of 
\a, /?] in H, which we continue to denote by \a, /?]. 

Note that the commutators [[a,/?], a] and [[a, /?],/?] arc both elements of F3, 
and hence have trivial images in H. It follows that [a, /?] € commutes with 
the images of a and P in H and hence is central. Thus, as the kernel of ip is the 
normal subgroup generated by (p, it is in fact the cyclic group generated by [a,P]. 
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To complete the proof of the exact sequence of Equation [U it suffices to show that 
no power of [a, /3] is trivial in the group H. 

Wc show this by constructing an explicit homomorphism from H to the Heisen- 
berg group H with integer coefficients, namely the group of matrices of the form 

(lac 
1 b 
1 

with a, b and c integers. 

The homomorphism ip : H ^ M is defined as follows. Let ^E* : — > M be the 
unique homomorphism taking a and /3 to the matrices Af(f,0,0) and M(0,f,0). 
By a well known (and straightforward) computations, '^{[a,f3]) — M(0,0, 1), and 
Af (0,0,1) is central in H. Hence ^'(Fa) is the trivial group. Thus, we get a 
well-defined homomorphism -0: H — F/F^ ^ M, which is clearly surjective. As 
ipiia^P]) =M(0, 0,1), it follows that V'([a,/?]'=) = M(0,0,fc). Hence if fc 7^0, [a, 13]'' 
is non-trivial as an clement of H. This shows that the sequence of Equation [T] is 
exact. 

Finally, we show that ip is injective, hence an isomorphism. Suppose tpig) = 1- 
Then as ip gives an isomorphism on the abelianisations of H and H, it follows that 
ab{g) = 0. Hence by the exact sequence, g = [a, b]'' for some k E Z. It follows that 
ipig) = M(G,0, fc), hence ip{g) = 1 =^ k = =^ g = 1. □ 

The homomorphisms a: F ^ Z and b: F ^ Z defined on F factor through H 
(as Fs C F2), and we continue to denote them by a(-) and b{-). We shall also denote 
the images of a and f3 in H hy a and /3. 

It is easy to see that for an element g € H, a{g) and b{g) are the entries a and b 
of V'(ff)- We can define the Heisenberg invariant in terms of the remaining entry c. 
However, it will be convenient to take a different approach based on the following 
proposition (which is a special case of normal forms that are well known in the 
literature). 

Proposition 2.2. Any element g (z H can be uniquely expressed as 
with V , a and b integers. 

Proof. Let h = g(3-'>'-9)a-'''-aK Then the image under the abelianisation map ip{h) 
of h is trivial. Hence h is in the kernel of (p. By Proposition 12. li h ~ [a, b]'^ for a 
unique v. Hence it follows that g = [a,PYa°'P'' with a = a{g) and b = b{g). 

To see uniqueness, observe by abelianising that \i g = [a, (3Ya°'ff' , we must have 
a = a{g) and b = b{g). Further, li h = gl3~^a~°' , then v is the unique integer such 
that h ^ [a, 6]"^, and hence is determined by g. □ 

Definition 2.3. The Heisenberg invariant of g G H is the unique v such that 
g = [a, (3Y a'^'' (3^ with a and b integers. The Heisenberg invariant of a word in F is 
the Heisenberg invariant of the image of the word in H . 

In terms of the above definition, the representation of Proposition 12.21 can be 
expressed as 

(3) 5 = a"(s)/3''(»)[a,^]"(s) 
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The higher hnk invariants are defined only for Unks with trivial linking number. In 
our situation, we have defined the Heisenberg invariant for all words. Nevertheless, 
it should be regarded as well-defined up to an error given by the abelianisation 
{a{g), b{g)). For instance, in Proposition [SHI we can interchange the order of (3 and 
a in the representation of g. An analogous result still holds but we get a different 
value of the Heisenberg invariant. 

Wc have the following simple properties of the Heisenberg invariant. In what 
follows all equalities are to be understood to be in the group H. 

Proposition 2.4. If gi,g2 G H have trivial ahelianisations, 1^(5132) = ^^(31)^^(32)- 

Proof. As gi and 32 have trivial abelianisation, by Proposition 12.11 q, = [a,/?]'"', 
j = 1,2, for some integers ki. By definition, v{gi) = h. Further, 5152 = [a, Z^]*^^^*^^, 
hence 1^(51.92) = fci + fc2 = '^{gi) + '^(92)- □ 

Proposition 2.5. If g Cz H has trivial abelianisation, then for h Cz H , ^{hgh) = 

Proof. By Proposition l2.11 as g is in the kernel of the abelianisation homomorphism 
Lp, g is central. Thus, hgh = g □ 

For the sake of clarity, we shall focus on elements g € H with trivial abeliani- 
sation. However, it is easy to obtain variants of all our results allowing for errors 
determined by a{g) and b{g) using the following proposition. 

Proposition 2.6. For k e Z, the following identities hold. 

(1) i^(a'^g) = v{g) 

(2) i.(.g/3^) ^ v{g) 

Proof. Asg = [a, P]'''^^) a'^ig) pbig) j^,^^] jg central, a'^g = [a, (3]''^3) a'^+'^^a) pb{g) ^ 
hence i^{a''g) = v{g). Similarly, gl3^ = [a, PY^s) o,<9) j3^i9)+k ^ j^g^^^g ^.^^^fc) ^ 
y{g). □ 

We shall say that the word g is balanced if a{g) = b{g) ~ 0. 

3. Lattice paths and Area 

In this section, wc give a geometric interpretation of the Heisenberg invariant, 
and some obvious extensions that follow from this. Examples viewed in this fashion 
form the intuition for the rest of the paper. As the results here are not used formally 
elsewhere, we shall skip most proofs. 

The plane contains the lattice Z^. We can associate to each word g = I1I2 ■ ■ .In 
in the letters a, a, (3 and P a path in the plane as follows. We start at the origin 
(0, 0). In the first step, we take a path from (0, 0) to one of the points (1, 0), (0, 1), 
(—1,0) and (0, —1) according as h is a, /3, a or p. Inductively, at the end of the 
(fc — l)th step we will have a path from (0,0) to a lattice point (p, g) G Z^. We 
extend the path by a unit segment joining (p, g) to one of the points (p, q) + (1, 0), 
{Pi q) + (0, 1), (p, q) + (-1, 0) and {p, q) + (0,-1) according as Ik is a, /3, a or /3. 

Thus, we obtain a path in the plane consisting of horizontal and vertical seg- 
ments. If a{g) = b{g) — 0, this path is a loop 7. The Heisenberg invariant is the 
oriented area, interpreted appropriately, bounded by the loops 7. If 7 is a simple 
loop, it bounds a region R. In this case, v{g) is ±Area{R), with the sign deter- 
mined by whether 7 is a counterclockwise or a clockwise loop around R. In general. 
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we can regard 7 as the boundary of a region R, allowing signs and multiplicities, 
which we define below in terms of winding numbers. The area of R, taking into 
account the signs and multiplicities, is the Heisenberg invariant i^ig). 

Consider a unit square A(p, q), p^q ^ Z^, with vertices {p,q), {p+l,q), (p, q + 1) 
and (p + 1,9 + !)• Let c{p,q) be the winding number of 7 about the centre z of 
A{p, q). We remark that we can take the winding number about any interior point 
to get the same result. The integer c(p, q) is the (possibly negative) multiplicity of 
the square A(p, q). 

Note that only finitely many of these coefficients are non-zero. We can interpret 
the Heisenberg invariant as (the finite sum) 

In these terms, there are obvious extensions of the Heisenberg invariant. The 
group acts on the plane by translations. We use multiplicative notation for 1? 
and denote generators by s and t, so that the action of s is translation by (1,0) 
and that of t is translation by (0, 1). Let A = A(0,0). Then A{p,q) is the image 
s^f^A of A. The region R can be expressed as the formal sum 

J2 c{p,q)sn'^A 

Thus, we associate to g the polynomial in two variables 
P,(s,t)- ^ cip,q)sH'^ 

The Heisenberg invariant is Pg{l,l). Extensions are given by other (linear) 
functions of the polynomial Pg. As mentioned in the introduction, it is natural to 
consider such functions F{Pg) so that if g is conjugate to g\ then F{Pg) = F{Pg'). 
This translates to being invariant under multiplication by the polynomial s*''^' for 
k,l G Z, i.e., for a polynomial P, F{P) = F{s^t^P). It is clear that the Heisenberg 
invariant Pg(l,l) has this property. We next show how to construct secondary 
invariants, i.e., which are invariant under conjugation provided the Heisenberg in- 
variant vanishes. 

Let Pg{s) = Pg{s, 1). Then Pg{s) is invariant under multiplication by any power 
of t. Let Vsig) — Pg(l)- Suppose ^{g) vanishes. Then for an element g' conjugate 
to g, Pg'{s,t) is of the form s'^t'-Pg{s,t). Hence, 

P'g,{l) ^ iS%ni) = iks'^~'pg)il) + iS%)il)=p'g{l) 

as Pg{l) ~ i^{g) = by hypothesis. 

We have a similar invariant taking t in place of s. In case these invariants vanish, 
we get further invariants by taking higher derivatives. 

We next turn to the general case, where we do not necessarily have a{g) — b{g) = 
0. As before, we get a path from (0, 0) to {a{g), b{g)). We make this into a loop 7 by 
extending this by the vertical segment to {a{g), 0) and then the horizontal segment 
to the origin. The Heisenberg invariant is then the area enclosed by this loop. 
Note that there are other minimal paths joining {a{g),b{g)) to the origin, which 
give different values for the Heisenberg invariant. Hence v{g) should be regarded 
as defined up to indeterminacy given by a{g) and b{g). 
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We give another interpretation of the above in homological terms. This will not 
be used in the sequel. 

The plane has a natural structure as a cell complex X with vertices lattice 
points, edges horizontal or vertical unit segments joining adjacent lattice points and 
faces unit squares. The group acts on this cell complex freely by translations. 
The quotient Y of the one-skeleton X'^' under this action is the wedge of two 
circles. This has fundamental group F, and the one-skeleton of X is the (Galois) 
cover corresponding to the subgroup F2 ~ F\ . 

Any word g in the free group gives a path in Y . This lifts to a path in 7 starting 
at the origin which can be regarded as a 1-chain in C\(X\ If a{g) = 6(17) = 0, then 
7 is a loop as g & F2, and hence is a 1-cycle. As the plane is contractible, this is a 
boundary 7 = dC,, C '= C2{X). As H2{X) and C3(X) are trivial, it follows that C is 
unique. 

Let A be a fixed unit square. Any other unit square is the image g/S. of A under 
the action of on C2(A"), and g is unique. As the unit squares are a basis of 
C2{X), we can uniquely express C as a finite sum. 



This has obvious extensions. We note that we can associate to g the element 
^iUigi in the group ring Z[Z'^] (which corresponds to the polynomial Pg{s,t)). 
This is well-defined (but is natural only up to multiplication by an element of the 
group Z^ ) . We have considered the image of this element under the homomorphism 
Z[Z^] — > Z taking each element of the group Z^ to 1. We can obviously obtain 
more refined estimate by considering either the full group ring, or at least other 
representation of the group ring. 



We collect in this section some elementary identities in the group H and formulae 
for the Heisenberg invariant. Recall that the element [a, /?] is central in the group 



Lemma 4.1. The following identities hold in H . 

(1) a/3a = [a,f3]/3. 

(2) For keZ, af3''a = [a, 

(3) PaP = [a, [3]-^ a. 

(4) ForkeZ (3a'' 13 = [a, /3]-'=a'=. 

(5) For g e H, aga = [a, (3]''^^^ g. 

(6) For g E H, l3g/3 = [a, f]]-''^^) 

Proof. As [a, f3]f3 = a/3a(3(3 = af3a, the first identity follows. As [a,/?] is central, 
the second follows by taking a power. 

Further, as [a,/3]~^ = Papa, we have PaP = [a,P]~^a. Once more we take a 
power to get the next identity. 

Next, if g £ H, by Equation [3l g = [a, /?]''^^^q;°'-^-'/3''^^\ hence we can express 
its conjugate by a as aga = {a[a, P]"''^^ a){aa°'''^^ a){aP''^^^ a). As a commutes 



We can interpret the Heisenberg invariant as 




4. Identities for the Heisenberg invariant 



H. 
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with [a, P] and with powers of a, using the previous identities and that [a, /?] is 
central, aga = [a, [a, = [a,(3]''(9)g as claimed. The proof of 
the remaining identity is similar. □ 

We deduce the effect of canceling a pair of letters in g that are not adjacent on 
the Heisenberg invariant. 

Lemma 4.2. Let gi and g2 be words in H . 

(1) v{giag2a) = 6(32) + '^{9192) 

(2) v{gil3g2l3) = -0(52) + 1^(3152) 

Proof. By Lemma 14711 as [a, /?] is central, giag2a = gi [a, PY'^'^^^92 = [a, P]''''^^^ 9i92- 
From this, it follows that i'{giag2a) = h{g2)i'{gig2)- The other identity is similar. 

□ 



5. Foldings and the Heisenberg invariant 

Consider henceforth a fixed word g = I1I2 ■ ■ - In of length n in a, a, (3 and (3. 
We shall regard the letters as cycHcally ordered, so that In+i = h- The word 
represents an RNA strand. There is an obvious description of RNA secondary 
structures without pseudo- knots in these terms. 

Definition 5.1. A folding (or fold) of the word g is a collection of disjoint pairs 
J- C {(«, j) '■ ^ < i, j < ri, i ^ j} such that 

(1) For {i,j) G either li ^ a and Ij ~ a or li ^ f3 and Ij = /?. 

(2) For pairs (ii, ji) G and (12,^2) & J^, ii < i2 < ji ii < j2 < ji and 
ii > i2> ji =^ ii > j2 > ii- 

We denote the number of pairs in T by \!F\. The folding process for RNA is 
governed by an attempt to maximise \!F\. We define a pair G to be an 

a-pair li U ~ a and a /3-pair if li = (3. Every pair is either an a-pair or a /3-pair. 

The condition on the pairs rules out nesting (i.e., pseudo-knots). For a pair 
G we define the word w{i,j) between the letters as follows. If i < j, 
then w{i,j) = li+ili+2 ■ ■ - Ij-i- If i > j, then w{i,j) = k+ili+2 ■ ■ -IJih ■ ■ -Ij-i- 
Thus, this is the word from the ith letter to the jth letter in the counterclockwise 
direction in the cyclic ordering. The word Dij{g) obtained from g by canceling the 
pair (i, j) G J- is the word with (n — 2) letters obtained by deleting li and Ij. 

Using the conjugacy invariancc of v, we can rephrase Lemma l4.2l in the following 
way. 

Lemma 5.2. For G J^, ifU = a, then v{Dij{g)) = v{g) + b{w{i,j)). Ifk = (3, 
then v{Di,j{g)) = v{g) - a{w{ij)) 

Given a folding T ~ {(ji,ii), (*2,i2), ■ • ■ , (*fe,Jfe)}, we can inductively compute 
v[g) as follows. Let go = 9 and let gi = Di-^j-^{g). Note that induces a folding 
J^i of gi consisting of pairs of letters in J- other than (ii, ji). We can thus continue 
inductively, deleting the pair corresponding to {12^2)- Thus, we get a sequence of 
words 5o,5i> ■ • ■ i 9k- 

Let c(i,j) = b{w{i,j)) iiU = a and c(i,j) = —a(w{i,j)) if U = /3. By inductively 
using Lemma [521 we get the following formula. 
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Lemma 5.3. We have 

k 

1=1 

Proof. First note that by Lemma [??^ i/{g) — h'{gi) + c(ii, ji). We shall iterate this 
process. To do so, observe that to apply Lemma l5.2l to gi, we need to consider the 
word U'i(z2, ^2) in 91 between 12 and j2. By the no nesting condition, this either 
equals ^(12, J2) or differs from this by a pair of letters which are inverses of each 
other. In either case, a{wi{i,j)) = a{w{i,j)) and b{wi{i,j)) = b{w{i,j)). Thus, 
by Lemma [5.21 ^{gi) = 2^(52) + c(i2, ^2)- We can now proceed inductively in this 
fashion to prove the claim. □ 

An a- folding is a folding such that for E k ^ a. A /?- folding is defined 
similarly. Consider a word g such that a{g) = 0. A complete a-folding is an a- 
folding such that if li = a, then for some j 7^ i, G The following 

proposition is straightforward. 

Proposition 5.4. If a{g) = 0, there is a complete a-fold for g. 

Proof. If no letter is a (hence no letter is a as a{g) = 0), we take J-' to be the empty 
set. Otherwise we can find a pair such that w{i,j) does not contain the letters 
a and a. We can proceed inductively by considering the word g' = w{j,i). This 
continues to satisfy the hypothesis and has fewer letters that are a than in g. A 
complete a-fold for g' together with (i, j) gives a complete a-fold for g. □ 

We can analogously define complete /3-folds. If h{g) — there are complete (3- 
folds. Observe that for a complete a-fold JF, \T\ is the number of letters that 
are a. Let np similarly denote the number of letters that are /3. 

Lemma 15.31 takes a particularly simple form for complete a-folds (and /3- folds). 

Lemma 5.5. let g he a word with a{g) = h{g) — and let T he a complete a-fold 
for g. Then using the notation of Lemma \5.9\ 

k 

1=1 

Proof. As is a complete a-folding, the word gk consists of the letters of g that 
are (3 or /3. Thus, g^ ~ ff''^^^ = 1 as b{g) = 0, hence v{gk) = 0. As li — a for all 
pairs (j, j) G c{i, j) ~ b{w(i,j)). Thus the claim follows from Lemma 15.31 □ 

6. Computing the Heisenberg invariant 

The results of the previous section give an elementary formula for the Heisenberg 
invariant from which it can be readily computed. We formulate this below. 

Let g be a word in the free group F. Let ^(O,*) denote the sub-word consisting 
of the i — 1 letters preceding li. 

Theorem 6.1. The Heisenberg invariant is given by 

Y,Hw{0,i))-J2b{wiO,j)) 

li—a h—'^ 
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Proof. The proof consists of inductively applying Lemma 15.21 and Proposition 12. 61 
Consider the letters h-^ , v ■ • h^. of g that are either a or a. Let go ~ g and let 171, 
(72, ■ • . , 3fc be obtained from g by successively deleting the letters /^j, /^j, ... h^. As 
gk is a power of (3, v{gk) — 0. Thus, it suffices to express v{gj) in terms of vigj+i)- 
Wc first express v{gQ) in terms of ^{gi). Suppose li-^ = a. Then using Lemma l5.2l 
and Proposition 12.61 we get 

v{9o) = v{ago) = v{9i) + &(w(0, ii)) 

as (71 is the result of canceling the first letter of ag^ with the letter corresponding 
to . Similarly, if ^ = a, we get 

= v{aga) = v{gi) - &(w(0,ii)) 

as can be readily deduced from Lemma l5.2l 

We now proceed inductively. We can use the same procedure as above to relate 
v{gj) with v[gj+i). As only letters that are a or a are deleted, the numbers 
^(^(O,^^)) are not altered during the inductive construction. Hence we still have 

with the sign determined by whether li. is a or a. Using the formula recursively 
gives the claim. □ 



7. A RIGIDITY THEOREM 

Consider a word g with a{g) ~ h{g) = 0. Let Ua be the number of letters in g 
that are a (and hence the number of letters that are a) and let np be the number 
of letters that are (3. Then the number of letters n of g is 2(na + np). 

Theorem 7.1. We have v{g) < (f )^ with equality if and only ifn^ = np = j and 
g is conjugate to the commutator [a"/*, . 

Proof. By Lemma 15.41 there is a complete a- fold J- for g. By Lemma 15.51 we have 

k 

1=1 

Observe that as w{i,j) is a sub-word of g, b(w{i.j)) < np. The number of terms in 
the above sum is n^. Hence it follows that h'ig) < UaUp. As the geometric mean is 
at most the arithmetic mean and n = 2(n„ + np), it follows that 



1^(9) < n-aUp < 




In case of equality, each of the above inequalities must be an equality. Hence 
as the arithmetic mean equals the geometric mean, na ~ np = n/4. Further, for 
each pair {i,j), b{w{i,j)) = np. it follows that the letters in w{i,j) include all the 
letters in g that are (3 and none of the letters that are ^. It is easy to deduce that 
g is conjugate to [a"/'',/?"/'*]. □ 
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8. The Heisenberg invariant and Local minima 

To motivate our next (and most interesting) result, consider the word [a"/^, 
(or more generally a word of the form [a*^, It is easy to see that any folding 
of this word is either an a-folding or a /3-folding. Hence to pass from a complete 
a-folding to a complete /3-folding at some intermediate stage, the RNA strand must 
be completely unfolded. 

The word [a"/''', is characterised by the Heisenberg invariant. However, 
this situation is too special. We show that if the Heisenberg invariant is close to 
the maximum value in an appropriate sense, then we have foldings Ti and J- 2 so 
that while passing from Ti to J-2^ at some intermediate stage there arc significantly 
fewer pairs than in Ti and J-2- To make this precise we introduce some notation. 

Fix a word g with a{g) = h{g) = and let n, Ua and np be as before. For a 
folding T of g, we define the deficiency p{T) as 



This is the fraction of letters that are not in some pair. The potential energy 
can be assumed to be a monotonically increasing function of the deficiency. 

We say that foldings T and T' are adjacent if their symmetric difference consists 
of a single pair. A path from J- to J-' is a sequence of foldings T = J-q, J^i, 
J'2t ■ -^k = ^' such that Ti is adjacent to Ji+i for < Z < fc. This represents 
a sequence of steps by which an RNA molecule can pass between one folding and 
another. 

We can now state our result. Assume that k is an integer with k/n small. Let J- 
be a complete a-pairing for g and T' a complete /3-pairing. We show that if v{g) 
is close to maximal, then for any path from T to T' ^ some path has much larger 
deficiency than both T and T' . 

Theorem 8.1. Suppose v{g) > (f )^ — k^- Then, 

(1) pi:F)<^ + ^ andp{r)<^ + ^ 

(2) Given any path = T, Ti, ^2,- ■ - Tu = T' from T to T' , for some I, 



l<l<k, p^{Ti) > i-24(f 



2 



Remark 8.2. If we assume that fc/71 is small, then Theorem 18 . 1 1 savs that deficiency 
of the foldings and J-'' are not much more than 1/2, while that of some inter- 
mediate folding is not much less than 1/V2- Thus, the deficiency, and hence the 
potential energy, increases significantly in passing from T to 



Proof of Theorem \8.1[ Assume henceforth that the hypothesis of Theorem 18.11 is 
satisfied. Our first step is as in Theorem 17. II 

Lemma 8.3. We have ria > n/A — k and np > n/4 — k. 

Proof. Let ria = n/A—p with p an integer. Then np — n/A+p. As a{g) = b{g) — 0, 
applying Theorem 15.51 for the folding J-, we get 

k 

1=1 
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As is a complete a- fold, the number of terms of the above sum is n/4 — p. As 
each word w{i,j) is a sub-word of g, b{w{i,j)) < np = n/A + p. Thus, 

, 2 



-(.9) < (4 



^2 



n 



As iy{g) > (-f )^ — k^, < fc^, from which the lemma follows. □ 

As is a complete a-pairing, \J-\ ~ Ua. Hence, as Ua > n/A — fc, an easy 
calculation shows that p{J^) < 5 + ^ as claimed. Similarly, p{J^') < 5 + 

We now turn to the second part of the Theorem. We shall first make some 
observations. Let J-" be a folding of g with pairs involving a and mp pairs 
involving /3. We shall use the formula of Lemma l5.3[ namely 

k 

v{g) = v{gk) + ^c{ii,ji) 
1=1 

Consider first the sum X)f=i J;)- This has two kinds of terms corresponding 
to a-pairs and /3-pairs. We consider these separately. First consider the mc-terms 
corresponding to € T" with li = a. Then c(i,j) ~ b{w(i,j)). 

The key observation is that b{w(i,j)) < np — mp. This is because b{'w(i,j)) is 
the difference between the number of letters in w{i,j) that are (3 and the number 
that are /?. As there is no nesting, if € with Z^/ = P contained in w{i,j), 

then Iji = /3 is also contained in w{i,j). Hence the net contribution of the letters 
of the pair to b{'w{i,j)) is zero. 

Thus, the sum of the terms corresponding to pairs with li = a is at most 
TTT'Oiinp — mp). We have a similar result for the /3-pairs. 

Now consider a path as in the hypothesis of the Theorem. It is easy to see that 
for some Z, the number of a-pairs in Ti is equal to the number of /3-pairs. Denote 
this number by m. We shall find a lower bound for p{J-i). Let the number of 
elements that are not in any pair be d. If gk is as above, then gi^ has d elements. 

Consider the equation 

k 

(4) v{9)=v{9k) + ^ciii,ji) 

1=1 

for J^i. As above, we consider separately the terms corresponding to a-pairs and 
/3-pairs. Let p be as in the proof of Lemma 18.31 We have seen that the total 
contribution of the a-pairs is at most m{np ~ m) = m(n/4 + p — m). By the 
inequality between arithmetic and geometric means, 

1 / Tl \ 

TO(n/4 + p — m) < — y— + pj 

Similarly, the contribution of the /3-terms can be bounded by 

\ / 11 \ 2 
m(n/4 — p — m) < — y— — pj 

Hence we get an upper bound on the sum 

v-^ ' ■ ■ - , 1 / /n 



(5) Z.c(z,,jO< 2 VV4 

1=1 

using p^ < as in the proof of Lemma 
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By hypothesis, i/{g) > (^) — k^. Hence, from Equation |4] and Equation [5] we 
get 

(6) K5.)>^((5)'-3fc^) 

By Theorem 17.11 iy{gk) < (c'/4)^, where d is the number of unpaired letters. 
By the definition of deficiency, d = np{Ti). Thus, from Equation [6] we obtain the 
inequahty 

or 

as claimed. 

□ 



One does not expect in general for the condition a{g) = b{g) = to be satisfied. 
However, one can use Theorem 18.11 provided a(g) and b{g) are small compared 
to n and i'{g) > (^) — k^ with k/n small. To do this, consider the word g' = 
Q,-'»(s)g/3-fe(9). By Proposition [211 iy{g') = i^{g). 

The word g' has length n' = n + a{g) + b{g). The hypothesis ^{g) > [j) — k^ 
can be rephrased as 

2 



/2 



,v2_^^2 I n'iaig) + b{g)) ( a{g) + b{g) 



with 



If fc, 0(17) and b{g) are all small compared to n, then it follows that k' is small 
compared to n' (as well as compared to n) . Hence we can apply Theorem 18.11 in 
this case (to the element g' and deduce for g). 

We can see that generically a{g) and b{g) are comparable to y/n. Namely, a and 
b for a random string can be veiwed as the results of (independent) one-dimensional 
random walks with Ua and np steps. Hence a{g) and b{g) are generically of the 
order of ^/n^ and ^/np, respectively. 

Results analogous to those of this section continue to hold if we modify our model 
so that nearby bases do not pair (as is the case biologically). As further restrictions 
only increase the number of unpaired bases (in particular of intermediate stages), 
it suffices to give lower bounds for the number of paired bases (hence upper bounds 
for the number of unpaired bases) for appropriate a-foldings and /3-foldings. An 
examination of our proof yields such a bound. Namely, if the Hcisenberg invariant 
is close to its maximal value (n/4)^, then (in a sense that can be made precise) for a 
complete a-folding most terms b{w{ii,ji) in Lemma 15.51 must be close to n/4. This 
means that most bonds in the complete a-folding are between bases that are not 
close to each other. Hence we can obtain a lower bound on the number of bonds in 
an a-folding without nearby bases paired. The case of /3-foldings is similar. 
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9. Lower bounds on deficiency 

Wc now turn to another important application of the Hcisenberg invariant. Let 
g he a word in the free group F with a{g) = b{g) = 0. As the Heisenberg invariant 
is a measure of non-triviahty, one expects that if I'ig) is large then the deficiency 
of any folding of 5 is large. We now prove such a result. 

Theorem 9.1. For any folding T of g, if p ^ p{^) 

3p' 4j.(g) 
P- — >^ 

Proof. As before, we use the formula of Lemma 15. 3[ namely 

k 

^{g) = v{9k) + ^c{ii,ji) 
1=1 

Let d = rip{g) be the number of letters that are unpaired in J- (hence the num- 
ber of letters oi gk)- We claim that c{ii,ji) < d/2. This follows if the number of 
unpaired letters of the sub- word w(ii,ji) is at most d/2 as pairs have canceling con- 
tributions to a(M;(zj,j2)) and b {w {i I, ji)) (as in the Proof of Theorem lS.ip . Otherwise 
w{ji,ii) has at most d/2 unpaired letters, from which we can deduce the result as 
a{w{ji,ii)) = -a{w{ii,ji)) and b{w{ji,ii)) = ~b{w{i,j)) since a{g) = b{g) = and 
lij and Iji are a canceling pair. 

The number of pairs in T is {n - d)/2. Further, by TheoremO i^{gk) < (f)^ 
Hence, as d = np{g), we get 

, , fdV d(n~d) n2 / 3p2 

from which the claim follows. □ 
10. The Milnor invariants and other extensions 

We now turn to the higher Milnor invariants. We shall be very sketchy in this 
section as our goal is to indicate further extensions of our methods. 

We begin by recalling the conceptual scheme for defining the Milnor invariants. 
Suppose g is an element of the free group F. The Milnor invariants (in our situation) 
are measures of how different g is from the identity. Further, these are invariant 
under conjugacy and are additive. Thus, they are very well suited for studying 
RNA folding. 

The Milnor invariants are defined in terms of the lower central series. The first 
Milnor invariants correspond to the image of g in the abelianisation F/F2. If this is 
trivial, then g is in F2, and we consider its image in F2/F3. This is the Heisenberg 
invariant. 

We proceed inductively, with the Milnor invariants of order n defined if those 
of lower order vanish. Namely, if the invariants of order less than n vanish, then 
g e Fn. We consider the image of g in Fn/F^+i, which is a finitely generated free 
abelian group. By choosing a basis for this group, we get finitely many integers 
determining the image. Thus, we have an infinite series of invariants. It follows from 
the Magnus expansion homomorphism (which we recall below) that Hi^i = {!}' 
hence all the Milnor invariants of g vanish if and only if g is trivial. 

In our situation, we would like to define the Milnor invariants of order n up to 
an error corresponding to the lower order Milnor invariants, as we did in the case 
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of the Heisenberg invariant. We can do this as we have a given basis for the free 
group F. Namely, we choose and fix sections Sk : F/Fk ^ F, i.e. functions between 
the underlying sets of F/Fk and F (not preserving the algebraic structure) so that 
for the quotient homomorphism qk '■ F F/Fk, wc have qk o : F/F^ F/Fk 
is the identity. Then for an arbitrary g F, g ■ (s„ o qn(g))~^ is in Fn, so we can 
define Milnor invariants by considering the image of g(sn ° <ln{g))~^ in Fn/Fn+i- 
This is what we did in the case of the Heisenberg invariant here, using the section 
(a, 5) i-^- a"" (3^ . As we are not studying the higher invariants in detail here, we do 
not construct explicit sections. 

We remark that in the original topological context, there is no canonical basis 
and hence one cannot make a well-defined choice of section. However, the Milnor 
invariants are defined modulo those of lower order. Such arithmetical considerations 
are not likely to be fruitful in the context of RNA. 

An explicit description of the Milnor invariants, allowing for efficient compu- 
tation, can be given in terms of the so called Magnus expansion homomorphism. 
Consider formal power series with integer coefficients in two non-commuting vari- 
ables X and Y (i.e., where XY is not YX for example). These form a ring Z[[A", Y]]. 
The Magnus expansion homomorphism M is an injcctivc homomorphism from the 
free group F to the multiplicative group of invertible elements of F]]. As is 
well known, the invertible elements of Z[[X,Y]] are the formal power series with 
constant term 1. The Magnus homomorphism is the unique homomorphism such 
that the image of the generators a and /3 satisfy 

Af (q) ^1+X 

M(/3) = 1 + Y 

By properties of formal power series, we have 

M(a) = 1-X + X^-X^ + ... 
M{P) ^1-Y + Y^-Y^ + ... 
In general, for a word g, we multiply the images of the letters of g to obtain 
Mig). 

For g G F, the constant term of M{g) is 1. The coefficients of X and Y are 
a{g) and b(g), and thus determine and are determined by the abelianisation. In 
particular, the coefficients vanish if and only if a(g) = b{g) — 0. 

Suppose a{g) — h{g) = 0. Then it is known that the term of degree two is of the 
form c{XY ~ YX). The Heisenberg invariant is v{g) = c. More generally, g G Fn 
if and only if the terms of degree less than n (except the constant term) vanish. 
As a consequence, we have a well-defined homomorphism of abelian groups from 
Fn/Fn+i to homogeneous polynomials of degree n in the non-commuting variables 
X and Y. This is injective but not surjective. The image consists of so called Lie 
elements, for which an explicit basis can be given. 

Thus, Milnor invariants can be defined as linear combinations of coefficients 
of the polynomial M{g). This expression is not unique as the Lie elements do not 
generate all the homogeneous polynomials of degree n. For instance, the Heisenberg 
invariant can be defined as the coefficient of XY or the negative of the coefficient 
of YX. If g is in i^„, then all these expressions give the same value. For a general 
element 5, the choice of a linear combination is analogous to the choice of a section 
Sn in the earlier description. As in the case of the sections s„, we do not give details 
of the relevant bases and coefficients. 
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We finally turn to the original topologieal context which motivated this work. A 
link L in a smooth 3-dimensional manifold is a disjoint union of smoothly embedded 
circles. An unlink is a link L so that the components of L bound disjoint, smoothly 
embedded discs. There are three well studied relations on links - those of isotopy, 
homotopy and concordance. We consider links up to these relations. 

Firstly, let L' be an unlink with two components. The fundamental group of the 
complement is a free group on two generators, and hence can be identified with F. 
Given a word g in F, consider a three-component link L, with the third component 
a curve 7 representing the element g (up to conjugation) in the complement of L' . 
This is not well-defined, but we may study properties of this link that depend only 
on g. 

The first observation is that the link is defined up to link homotopy, i.e., changing 
the link through a family so that each component is allowed to cross itself but not 
others. For such three-component links, the link is determined by exactly three 
invariants. These correspond to a(g), b{g) and i^ig). We get further invariants by 
observing that we can keep the first two components fixed and only allow 7 to cross 
itself. We expect that it will be fruitful to study the link L up to concordance 
(which implies link homotopy). Link homotopy, isotopy and concordance have 
been extensively studied. We refer to [T]-[TT] for some of the fundamental results 
concerning these. 

11. Concluding remarks 

In this article, we have constructed an easily computed number, the Heiscnberg 
invariant, associated to a strand of RNA, and showed that it is related to a dynam- 
ically important property of Watson-Crick pairing. Values of the invariant large 
enough to yield dynamically interesting results are not generic. However, in bio- 
logical systems it is important to understand molecules with special properties, as 
evolution ensures that living systems are not generic but have compositions giving 
desirable properties. 

We end with a conjecture regarding the Milnor invariants of order one more than 
the Heiscnberg invariant. We have seen that the Heiscnberg invariant being large 
implies that we have minima that are widely separated. We expect that the next 
invariants being large implies that there are minimax paths between minima that 
are widely separated. This is significant in the context of catalysis. 
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